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Abstract. For semi-linear wave equations with null form non-linearities on R 3+1 , we exhibit 
an open set of initial data which are allowed to be large in energy spaces, yet we can still obtain 
global solutions in the future. 

We also exhibit a set of localized data for which the corresponding solutions are strongly 
focused, which in geometric terms means that a wave travels along an specific incoming null 
geodesic in such a way that almost all of the energy is confined in a tubular neighborhood of 
the geodesic and almost no energy radiating out of this tubular neighborhood. 



1. Introduction 

In this paper, we study the Cauchy problem for the following semi-linear wave equation on 

n<p = Q(V<p,Vip), (l.i) 



where Q is a null form (see Section 2.2 for definitions) and ip : R 3+1 -> M is a scalar function. 
The data that we will consider for (1.1) will be some specific large data. In fact, the size of the 
data is measured by a large parameter <5 _1 (where S is sufficiently small) in energy spaces. We 
remark that the results of current work can be easily extended to higher dimensions and to a 
system of equations with null form nonlinearities in the obvious way. 

1.1. Earlier Results. We briefly summarize the progress on small data theory for nonlinear 



waves related to equations of type ( 1.1 ). Based on the decay mechanism of the linear waves, we 



know very well about the Cauchy problems for (1.1) on Minkowski space-times R n+ , especially 



for those with small initial data. In dimensions four and higher, since the linear wave decay fast 

3 , 



enough (at least at the rate t~ a ) , the small-data- global-existence type theorems for (1.1) hold for 
generic quadratic nonlinearities (which are not assumed to be null), see the work of Klainerman 
[7j. However, in M 3+1 , the linear waves decay slower and the quadratic nonlinearities control the 
dynamics of the system. In fact, there are quadratic forms Q for which a finite time blow-up 
phenomenon occurs even for arbitrarily small data. This has been shown by F. John in [JJ. 



The main breakthrough in understanding the small-data-global-existence result of (1.1) was 
made by Klainerman [6j by introducing the null condition on the nonlinearities. Under this 
condition, Klainerman and Christodoulou [lj independently proved that small initial data lead 
to global in time classical solutions. Their proofs are different in nature. Klainerman's ap- 
proach makes use of the full conformal symmetries of M 3+1 through the vector fields, while 
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Christodoulou's idea is to use the conformal compactification of IR 3+1 . Nevertheless, their proofs 
rely essentially on the special cancelations of the null form nonlinearities, which are absent for 
generic quadratic nonlinearities. 

The cancelations of null forms has far-reaching implications for other types of hyperbolic equa- 
tions. Although many hyperbolic equations do not in general have a null quadratic form type 
nonlinearity, the estimates for the nonlinear terms follow more or less the similar philosophy: if 
one term behaves badly (i.e., large in some suitable norms in most cases) in the nonlinearities, 
it must be coupled with the a good (i.e., with a much better or smaller estimates) term. Thus, 
we hope that the good terms are strong enough to absorb the large contributions from the 
bad terms. One major application of this idea in general relativity appears in Christodoulou- 
Klainerman's proof of non- linear stability of Minkowski space-time [3]. They observed that a 
bad (decay worse) component of Weyl curvature is always coupled to either a good connection 
coefficient or a good curvature component, thus in most cases the bad components do not really 
affect the long time behavior of the gravitational waves. 

Although all of the aforementioned results require the initial data be sufficiently small, the idea 
of using cancelations from null forms still can be used to handle certain large data problems. We 
shall briefly describe two very recent works on the dynamics of vacuum Einstein field equations 
in general relativity. 

In his seminal work [2], Christodoulou discovered a remarkable mechanism responsible for 
the dynamical formation of black holes. For some carefully chosen initial data (which give an 
open set of the Sobolev space on a outgoing null hypersurface) , called short pulse data in [2], he 
proved that a black hole (more precisely, a trapped surface) can form along the evolution due 
to the focusing of the gravitational waves. Besides its significance in physics, this result is truly 
remarkable from a PDE perspective, because the result is for large data (roughly speaking, small 
data for Einstein equations in general relativity would lead to a space-time close to Minkowski 
space-time. So for small data, we do not expect black holes). One of the key observations used 
repeatedly in [2] is still related to the philosophy of null forms: we do have many bad (large) 
components in the estimates, but all of them must come with good (small) components to make 
the estimates work. 

In [8j, Klainerman and Rodnianski extended and significantly simplified Christodoulou's work. 
A key ingredient in their paper is the relaxed propagation estimates, namely, if one enlarges the 
admissible set of initial conditions, the corresponding propagation estimates are much easier to 
derive. They reduced the number of derivatives needed in the estimates from two derivatives on 
curvature (in Christodoulou's proof) to just one. We should note that the direct consequence 
of the simpler proof of Klainerman-Rodnianski yields results weaker than those obtained by 
Christodoulou. In fact, within those more general initial data set, they can only show long 
time existence results for vacuum Einstein field equations; nevertheless, once such existence re- 
sults are obtained, one can improve them by assuming more on the data, say, consistent with 
Christodoulou's assumptions and then one can derive Christodoulou's results in a straightfor- 
ward manner. 

The results of this paper are strongly motivated by [2] and the proofs are very much inspired 
by [8]. In particular, the choice of initial data will be analogous to the short pulse ansatz in 
[2]; the proof will rely on a relaxed version of energy estimates similar to the relaxation of 
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the propagation estimates in [8]. We also have to mention another work [9] of Klainerman 
and Rodnianski where they managed to localize the data for Einstein equations to show the 
dynamical formation of locally trapped surfaces. One of our main results here concerning the 
strongly focused waves is motivated by this work. Roughly speaking, it asserts that if the wave 
initially concentrates around a given point in a specific way, then it will be confined in a tubular 
neighborhood of an incoming null geodesic and there is only a negligible amount of energy 
dispersing out of this neighborhood. It is precisely in this sense that we say the wave is strongly 
focused. It seems to the authors that this result is new even for linear wave equations. 

1.2. Main Results. We first state the main theorems of the paper: 

Main Theorem 1. For any given positive number Eq > 0, there exists a smooth initial data 
set ((p(°\ipW) for 

n<p = Q(V<p,Vtp), 

( 93 ,^)| t =o = (^ (0) ^ (1) ), (L2) 
where Q is a null form, so that the energy 

Energy = \ [ |V<^°)| 2 + \ip^\ 2 d Xl dx 2 dx 3 > E , 

and this data set leads to a classical smooth solution of the above equation. Moreover, the future 
life-span of the solution is [0,oo). 

Remark 1.1. Moreover, if for k S N we define the k-th order energy of the data as 

Energy ' (fc) (y>< 0) , y>«) = \ J ^ |VV (0) | 2 + N k ~ l ^\ 2 d Xl dx 2 dx 3 , 
we can show that 

Energy^W^^^S-^, 

where 5 is a small positive parameter. We note in passing that the higher order energies can be 
extremely large. 

In the course of proving the above theorem, we will derive two other results which are of 
independent interest. To facilitate the statement of these two results, we introduce a bit of 
notation. 

We review some geometric constructions on Minkowski space M 3+1 . Besides the standard 
coordinates (t,xi,x 2 ,xs), we shall mainly use the null-polar coordinates (u,u,9). We recall the 
definition for null-polar coordinates. Let 



ry £ I ,-y> ^ I ry^ 



be the spatial radius function. Two optical function u and u are defined by 

u = -(t — r) and u=-(t + r). 

The angular argument 9 denotes a point on the unit sphere § 2 Cl 3 . 

The past null infinity I ~ of IR 3+1 can be represented by the collection of past-pointing outgoing 
null lines. Therefore, 1~ is parameterized by (u, 9) G Mx§ 2 . We also use C c to denote the level 
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surface u = c, where c is a constant; similarly, (7 U denotes a level set of u. Their intersection 
C u H C_ u will be a two sphere denoted by Su ;U . 

We illustrate these definitions in the following pictures: 




We remark that, in Section [3] and Section [4j which are the technical heart of the paper, the 
parameter u will be confined in the interval [no, —1], where uq is a large negative number. The 
parameter u is confined in [uo, 5], where 5 is small positive parameter, which will be determined 
later. To simplify our presentation, we will ignore the 9 directions in our pictures. Thus, instead 
of the left picture above, we will adopt the right picture; as such, the sphere S^ u is represented 
by a single point in the picture. 



We use L and L to denote the following future-pointed null vector fields: 

L = dt + d r and L = dt — d r . 

We shall use y to denote the intrinsic covariant derivative on S u u - It is the restriction of the 
usual covariant derivative of (R 3+ , g) to S uu , where g is the standard flat Lorentzian metric on 
M 3+1 . 

As usual, we use 50 (3) to denote the Lie algebra of the rotation group SO (3) which acts on 
M 3+1 in a canonical way. We choose a set of generators {Qi , r^, ^3} of so(3) in the usual way, 
namely, 

d d 

"1 = X2T. X 3 - , 

d d 

1*2 = £3^ Xi- , 

OX\ OX3 

d d 

*h = ^27^ — • 

OX2 OX\ 

In what follows, we shall use ft to denote a generic Oj and use Q 2 to denote a generic operator 
of the form f^fij, and so on. For a given function (j), we use \fl(f>\ to denote the sum Y2i=i 
and use \Q 2 4>\ to denote the sum X^i<i j<3 l^i^i^l) an d so on. 
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We observe that there are two positive constants C\ and C2, such that on Einy Su,u 

we have 

Ci\u\\fcp\ < \Q<p\ < C 2 \u\\fcj)\. 

For the sake of simplicity, we write this inequality as |f2</>| ~ |it||y</>|. The proof is straightfor- 
ward: we first check it on the unit sphere and then we use the scaling to get the factor \u\. In 
general, for a given k E Z>o, we have 

\n k (j)\ ~ \u\ k \f k (/)\. (1.3) 

The geometric picture all the way to null infinities is usually represented by the Penrose 
diagram of (M 3+1 ,g): 



I = future null infinity 



1 = const. !- ■ 



t=0 l 




space-like infinity 



7 = past null infinity 



f = past time-like infinity 



For many situations in the current work, one has to study the Goursat problem of (1.1) 



namely the characteristic problem, instead of the Cauchy problem of (1.1). Therefore, the null 
hypersurfaces are the main geometric objets in sequel. 



Our second main theorem is a semi-global existence result for a Goursat problem of (1.1) 



where the data is described on a part of a virtual null hypersurface, i.e., the past null infinity 
I~ . More precisely, the initial data of (1.1) will be a radiation field given on the subset 



J- = {(u,0) £l~ \u<8}, 

in the asymptotic sense, where 5 is a positive small parameter determined later. Explicitly, the 
datum is given by a smooth function 



: T 



(u, 9) i-)- ip _ 



and we require the solution of ( |1.1[ ) to obey the asymptotic condition, 



(p(u,u,6) ~ -r-r^-00 

M 

1 



(M) + o( ) 



We remark that, for linear waves, , — r is the expected decay rate towards past null infinity. We 

|u| 

also remark that, when we speak about the smallness or largeness of the data, we always mean 
the smallness or largeness of the radiation filed 92-00 instead of ip itself (which vanishes onl"). 
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In this work, we require the initial data tp-oo to have the following form, 

if u < 0, 

^M)H,k,^ m if0 < M ^ (i.4) 

where ipo : (0, 1) x § 2 — > R is a fixed compactly supported smooth function. More generally, we 
can put c^-oo in an open set of certain Sobolev spaces defined onl". We do not pursue this 
point at the moment and we shall revisit it at the last section of the paper. 

The datum given in the above form is called a short pulse, a name coined by Christodoulou in 
[2] . In his work, he prescribes the shear (more precisely, the conformal geometry) of the initial 
null hypersurface in a similar form. The shear in the situation of [2] is exactly the initial data 
for Einstein vacuum equation. 



One may argue that the datum (1.4) is small when 5 is small, at least pointwisely. In fact, 



the L°° norm is irrelevant to the equation (1.1) since we may always add a constant to get a 



new solution. The size of the datum should be measured at the level of derivatives. While the 
d u derivative of the datum can be extremely large if 5 is small. In sequel, we shall see that the 
energy of cp will be comparable to 1 and the higher order energy of 92 will be comparable to some 
5~ a with a > 0. Therefore, the datum is no longer small in energy spaces. 

Before the statement of the second main theorem, we recall that the domain of dependence 
T> + (Ij) oil J is the backwards light-cone in M 3+1 with vertex at (<5, 0, 0, 0). 

Main Theorem 2. For the non-linear wave equation 

n<p = Q(V<p,V<p), 

where Q is a null form, with the following asymptotic characteristic initial datum on the out- 
going null infinity I ~ ; 

lim \u\(p(u,u,9) = ip- OQ (u,9) for all (u, 9), (1-5) 

u— >oo 

where (f-oo £ C°°(IJ) is given by 



f~oo{u,9) 



'0, ifu<0, 
1 u 

J2^ o (=0), if0<u<5. 



Here -00 : (0, 1) x S 2 — > R is a fixed compactly supported smooth function. If S is sufficiently 
small, there exists one and only one classical solution ip on V + (1J) n {t < -1}, such that the 



radiation field of ip is exactly (p^, i.e. as described in (1.5) 



The theorem can also be depicted as follows (notice that the region T> + (I s ) n {t < —1} is 
enclosed by four red lines): 



LARGE DATA REGIME FOR NLW 



7 



t = -1 




I = future null infinity 
i c = space -like infinity 



r40 



y ro = the radiation field 



J = past null infinity 



i = past time-like infinity 



In the course of the proof of Main Theorem 2, we shall see that the energy flux through C_$ 
(if fact, the flux will be bounded above by 5 a for some a > 0). Thus, almost all of the energy is 
confined in D + (Ij)n{t < — 1} and there is very little energy radiates to the future null infinity. 
Intuitively, the waves travel from past null infinity in the incoming direction all the way up to 
the finite time t = — 1 with almost no loss of energy. This manifests a strong focusing effect of 
the nonlinear wave for a special choice of data. 



We now turn to the last main theorem of the paper where the data are prescribed on a fixed 
finite null hypersurface C UQ instead of on past null infinity. Only for this third main theorem, 
we can fix a finite uq < —2, say no = —10. 

The initial out-going null hypersurface is 



"0 



{(«, u, 9)\uq < u < 5}, 



and we also require the initial data ip uo to behave like a pulse: 



fu (u, 6) 



0, 



if uo < u < 0, 



A 2 fi 

^/>o(=,0), if0<u<5. 
\uq\ d 



;i.6) 



Beside the above requirement, we impose the condition that ipo is localized for the angular 
argument 9, i.e., there is a fixed angle #o € § 2 , so that the support of ipo is contained in a 
geodesic ball B i (8q) centered at #o of radius 5? on S 2 . Moreover, we can require tpQ satisfy the 

2 

following estimates on S 2 , 

4 

h— 1 
2 



^^||f fe V0|Uoo (S2) < 1, 



fe=l 



where y is the covariant derivative for the standard metric on S 2 . We call such initial datum 
ip UQ (u,9) a short pulse localized in B i (#o)- 

o 2 
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Main Theorem 3. Consider the non-linear wave equation 

ntp = Q(V<p,V<p), 

with characteristic initial data given by a short pulse data localized in B i(#o)- If $ i> s small 
enough, there exists one and only one classical solution ip on V + (C Uo ) n {u < — 1}. 

Moreover, if one uses C° = ux [0, 5] x B i (6q) to denote a small open set on the outgoing null 
hyper surf ace C u , the energy is almost localized in U ue [ U0 _i]C° which is a tubular neighborhood 
of some incoming null geodesic parameterized by u £ [uq,— 1] with fixed u and 8 = 9q. More 
precisely, for all u G [uq, — 1], the incoming energy outside the small neighborhood C°, is bounded 
as follows 

f o \Lv\ 2 + \fv\ 2 <o\ 
J c u — c° 

And the energy inside C° is almost conserved, 

| / li^f + lyVI 2 - f \L<p\ 2 + \ytp\ 2 \<5. 
Jc u Jc° 



The above theorem can also be depicted as follows 




We can also show that there is almost no energy radiating out through the incoming null 
hypersurface C s . Quantitatively, we have 

/ \L^\ 2 + \f^\ 2 <S. 

The above estimates are also true for higher order flux and this will be clear later. 

In the proof, we shall see that we can prove a stronger version which we do have L°° control on 
all the first derivatives of ip. On the final outgoing null hypersurface C_i, the energy is mostly 
contributed by Lip and the other components are small (measured in terms of a positive power of 
5), in fact, we can show that Lip is almost unchanged along the evolution, i.e., \Lip(— 1, u, 6)\ ~ 
\Lip(uo,u,9)\. More precisely, 

\Lip(-l,u,9) - Lip(u ,u,e)\ < <5i 



LARGE DATA REGIME FOR NLW 



9 



Since we take a short pulse data localized in B 1 (9q), roughly speaking, we have 



Lip(u ,u,6) 



>3, fox 9 € B gh (e ), 

0, for 6<£B s i{e ). 



Thus, for the final surface C_i, roughly speaking, we actually have 

Lip(-l.u,9) ~ < i 52 

\*5, for^Bi(^ ). 

We then integrate those L°° estimates to derive the desired control on energy. 

Therefore, we have a concentration phenomenon for the a special class of solutions of (1.1) 
and we say that the solution constructed out from the short pulse data localized in some small 
spherical sector are strongly focused. As we remarked before, the third theorem appears to be 
new even for linear wave equations. 

1.3. Comments on the Proof. We would like to address now the motivations for and diffi- 
culties in various estimates leading to the theorem, and then we give an outline of the proof. 

We first explain the idea of the relaxation for the energy estimates. This is done in Section [4] 



which provides an a priori estimate up to four derivatives for (1.1) with short pulse initial data 
It is the key ingredient for the whole paper. 

The proof is based on the usual vector field method to obtain these estimates. We compute the 
amplitudes of Lip and ^ip on C UQ where the data are given. Roughly speaking, the quantitative 
estimates look like 

ll?ylU°°(c U0 ) ~ <^ 2 - 

When one derives energy estimates for <p, a natural choice of the multiplier vector field would 
be L, thus, the energy flux would be 



f \l^\ 2 + f lyvp. (ii 

J Cu " Cu_ 

If we stick to (1.7) as an ideal propagation estimates without any relaxation, we expect 

•J C u 

I \fip\ 2 ~ 5. 
Jc. 



Therefore, the flux term (1.8) will only yield the bound for L<p and not for y/ip, because the 
bound for y/ip is too small compared to that of L<p estimated in this way. In other words, in 
the end, we do not expect to close the argument if we perform a standard bootstrap argument. 
Of course, this is due to the choice of the initial data: the short pulse data do not respect the 
natural scaling of the wave equation! 
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To resolve this difficulty, one has to relax the estimates for ^ip, namely, although the size of 
the initial data suggest the amplitude J7ip behaves like 5%, we pretend it amplitude is worse to 
match (1.7). Therefore, in the energy estimates, we only expect ^ip to behaves like 

\\W\l^(c v ) ~ 1. (1-9) 

It turns out that, under this weak assumption, we can still close a bootstrap argument to derive 
energy estimates. Moreover, once the bootstrap argument is closed, if we can afford one more 



derivative, we can retrieve the stronger estimates (1.7) for "flip. This will be proved later. 



The second difficulty is about the number of derivatives needed for the a priori estimates. 
Instead of four derivatives, we may attempt to use three derivatives in Section |4j since this is 
still good to control L°° norm of one derivatives via Sobolev inequalities. This does not work in 
an obvious way and the reason is as follows: when one derives estimates for third derivatives, 
we can use the information already obtained for the first and second derivatives, but Lip (the 
worst term) still consists a third derivative term. Thus, we can not reduce the nonlinear term 
to a linear one. But this is completely different if we use four derivatives: when one tries to 
control fourth derivatives of the solution, we must have already obtained estimates up to three 
derivatives. Thus, the control of Lip is then independent of the fourth derivatives. Hence, this 
reduces the nonlinear term to the linear case where the Gronwall's inequality can used to absorb 
all the bad terms. 

We also point out that the second difficulty is also related to the relaxation of the energy 
estimates. If we use only three derivatives, for some null forms, say Qoj, it leads to a nonlinear 
term of the form ^ip • Lip. As we commented in last paragraph, we do not have linear control 
on the L°° norm of Lip, and yet because we use a relaxed estimates, the control of ip is not 
good enough to compensate for the large amplitude of Lip. This will lead to a large nonlinear 
term which can not be controlled. 

We now outline the proof of our Main Theorem 1. The parameter uq is a large negative 
number which will be eventually sent to -co. The following picture helps to understand the 
structure of the proof: 



r = -l 




a — » — x. eventually 
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• Step 1. We prescribe initial datum on the null hypersurface C UQ where uq < u < 5. 
When no < u < 0, the datum is trivial, therefore the solution in Region 1 in the picture 
is a constant map. When < u < 5, the datum will be prescribed in a specific form (see 
Section [3] for detailed account): 

1 11 

<p(u,uo,0) = 52ip ( = ,6), 

with the energy approximately equal to Eq. We then show that we can construct a 
solution in Region 2 in the picture. 

• Step 2. From the first step, we actually can show that the restriction of the solution 
already constructed to C s are small in energy norms. On C+, which is defined by 

C+={ P eC+\5<u(p),t(p)<-l}, 

we extend the datum (from Step 1) by zero. Therefore, the datum is also small on . 
We can now use C 5 and as initial hypersurfaces to solve a small data problem to 
construct solution in Region 3 in the picture. 

• Step 3. We patch the solutions in Regions 1, 2 and 3 to get one single solution in above 
picture and then restrict it on the surface S„ . We then let uo go to —00 and use the 
Arzela-Ascoli lemma to get a solution all the way up to past null infinity. The restrictions 
to X uo then yields a subsequence which converges to a Cauchy data. Finally, we can 
reverse and shift the time to complete the proof of Main Theorem 1. 

We remark that Step 1 is the most difficult part since the datum is no longer small and we 
have to carefully deal with the cancelations from null forms and the profile of the data. Steps 2 
and 3 are more or less standard. 



2. Preliminaries 

2.1. Energy Estimates Scheme. Let (ft be a solution for the following non- homogenous wave 
equation on M 3+1 , i.e., 

= (2.1) 
We define the energy momentum tensor associated to (ft to be 

T a p[cft] = V a cftVp<ft - \g a pV^cftV»cft. 

This tensor is symmetric and it enjoys the following divergence identity, 

V a T a p[cft\ = $V^. (2.2) 

Given a vector field X, which is usually called a multiplier vector field, the associated energy 
currents are defined as follows 

J^[<f>)=T^[<ft]X^ 
K X [4>] =T^[cft] 
where the deformation tensor ^tt^ u is defined by 

(X) k„v = \cxg» v = \{^^X V + V U X„). (2.3) 
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Thanks to (2.2), we have 

V Q J^M = K x + X<p (2.4) 
In the null frame {ei, e2, e% = L, = L}, we compute TT Q( g[0] as 

T(L,L)[0] = |L0| 2 , 
T(L,L)[0] = |V70| 2 , 
T(4L)[0] = |L0| 2 . 

This manifests the dominant energy condition for T aj g[0]. 

We shall use X = so (3)), L and L as mutiplier vector fields, the corresponding deforma- 
tion tensors and currents are computed as follows, 

2 , (t.\ 2 , 



(n) 



7T 



n 



0, 



(i) 



0, K 



7T 
L 



7T 



1 



~-L<j)L<t), K L 



1 



(2.5) 



-Lcf)L4>. 



where <j is the restriction of the Minkowski metric m on a two sphere 5 U)U . 

We use u) to denote the space-time slab en- 
closed by the hypersurfaces C Uo , C , C u and C_ u . 
We integrate ( 2.4 ) on T>(u, u), which is a domain 
enclosed by the null hypersurfaces C U ,C_ U ,C UQ 
and C , to derive 




T[<f>](X,L) + / T[<f>](X,L) 



T[cf>](X,L)+ / T[cf>}(X,L) 



+ 



K x [(/)] + <5>-X(j). 



D(u,u) 



where L and L are corresponding normals of the 
null hypersurfaces C u and C_ u . 
In applications, the data on C is always vanishing, thus, we have the following fundamental 
energy identity, 

/ T[<j>]{X,L)+ I T[<p](X,L)= [ T[4>](X,L)+ [[ K x [4>] + $ • X<j>. (2.6) 

JCu JC^ JCu Q JJV(u,u) 

2.2. Null Forms. For a real valued quadratic form Q defined on M 3+1 , it is called a null form 
if for all null vector £ G M 3+1 , we have £) = 0. 
We list seven obvious examples of null forms (a ^ /?), 

where £, 77 € M 3+1 and a,/3 S {0,1,2,3}. In fact, we can easily show that the space of all null 
forms on M 3+1 is a real vector space and its dimension is 7. The above seven quadratic forms 
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yield a basis for the space of null forms. Thus for an arbitrary null form Q, it can be written as 
a M-linear combination of basic null forms in (2.7). 

Given two scalar function <j>, ip and a null form Q(£, rj) = Q a ^ £ Q rip, the expression Q(X7(f>, \7ifj) 
means Q(V0, V^) = Q a P d a (/> dpip . 

For a given rotational Killing vector field f2 S so (3), we have 

nQ(V<f>, Vip) = QiVUcj), VtP) + Q(V0, V^V) + Q(V0, W>), (2.8) 



where Q is another null form. It suffices to check this claim on the basic null forms in (2.7). In 
fact, for Q = Oy, one can check immediately that, 

ttijQo(V(p,Vip) = Q (Vn ij( f>, W) + Qo(V^, Vikjif}), 

nijQapiV&V^) = Q a /3(VQij(j), Vip) + QapiVfaVSUrt) + Q(V(f>,Vip), 

where 

Q = SiaQjP — 5j a Qi(3 + ^jpQia ~ ^ipQja- 

For a vector field X, we denote 

(Q o X)(V0, VV) = Q(VX0, VV) + Q(V0, vxv>), 
and [Q, X] = XQ — Q o X, we then have 

r 

[L,Q o ](V0,V-0) = --(Qo(V0,VVO + £^> + £W), 
r 

[X,Qy](V^,V^) = -Q^(V^,V^) + ^{(^-^)%'i/' + (^-^)^>}, 

[L, Q y ](V0, V^) = -~Qtf(V& V^O + ^{(^ " £0)0^ + - 

[L,Q Oi ](V0,V^) = -Qoi-^iL^-L^), 
r lr z 

[L, Q w ](V0,V^) = -*Qoi - ^(L^-L^). 
Schematically, we write the above as 

[L, Q] (V0, W) = 1 [Q(V0, WO + + LfiLip + 

r 

+ lv>W + + Wft] , 

(2.9) 

[L, Q] (V0, W) = - [Q(V0, VV) + + + 

r 

+ Lipfcp + L</>yv + ■ 

Besides the above algebraic properties of null forms, from the analytic point of view, in a null 
form Q(V(f), V^O, a 6aa! component is always coupled to a f/ooci component. To make a precise 
statement, we remark that in this paper, all the derivatives of tp involving the outgoing direction 
L (for example Lip and Uflip) are bad since in L°° norm, their size is comparable to 5~? which 
is large and it is in this sense they are bad components; for other derivatives, their size is at 
least as good as 6* which is small and it is in this sense they are good components. To see why 
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a bad component is coupled to a good component, we use null frame {ei, e2, e3 = L, e4 = L} to 
express the null form as follows, 

Q(V(f>, VV) = Q 43 Lcf> Lip + Q 34 Lip + Q 4a Lc/> ^ + Q 3a U> f a i/> (2.10) 

+ Q ai fJL^ + Q a4 fj Lip + Q ab fj 



Once again, to prove (2.10), it suffices to check for basic null forms in (2.7). 

In particular, ( 2.10 ) shows that Lip-Lip is forbidden which is a product of two bad components. 
We also observe that the coefficients appearing in (2.10) are bounded by a universal constant. 
Therefore, in applications, we shall bound the null form pointwisely as follows, 

|Q(V0, WOI < \L<t>\ \m + \Lff,\ \I4\ + \f4>\ \fip\ (2.11) 

+ (\L<p\ + \m)\y^ + WMLM + \m)- 

2.3. Sobolev and Gronwall's Inequalities. We first recall Sobolev inequalities on C u , C u 
and S UiU . For any real valued function <p and q > — we have 



i i i i 



M*\\<t>\\ LHS& , u) < \\mh {Cu) iu\\h (Cu) + wnn\\b<pj* 
h 9 u\\l H s^) < \u mn LHS ^ o) + iiiui^iii^ciiitirviii^) + ni-wiii^,), 



i 



The proof is based on the standard isoperimeric inequality on unit sphere. We refer the reader 
to [2] for a proof. We remark that in the first inequality, we assume that (p = on C . This 
assumption is always valid when we apply the inequality in this paper. We also need a variant 
of the second inequality. In fact, Take q = 0, we derive 



i ii 



\ms^) < u\\lhs &uo ) + ii^iii 2 (cj(iii^rviii 2( cj + \\n\\i m j- ( 2 - 12 ) 

Next, we recall the standard Gronwall's inequality. Let 4>{t) be a non- negative function defined 
on an interval I with initial point to- If 4> satisfies the following ordinary differential inequality, 

— (f) <a-(p + b, 

where two non-negative functions a, b £ L l (I), then for all i £ /, we have the following estimates, 

Ht)<e A(t) mo)+ I e- A ^b(r)dr), 
J t 

where A(t) = f tQ a{r)dr. The proof is straightforward. And there is another version of Gron- 
wall's inequality |8J, which will be useful in the proof. 

Lemma 2.1. Let f(x,y),g(x,y) be positive functions defined in the rectangle, < x < xq,0 < 
V < Ho which verify the inequality, 

px i-y 

f(x,y) + g(x,y) < J + a f(x',y)dx' + b g(x,y)dy' 

Jo Jo 

for some nonnegative constants a, b and J. Then, for all < x < xq, < y < yo, 

f(x,y),g(x,y)<Je ax+b y. 
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3. Initial Data for Region 1 and 2 
Let C UQ be a truncated light-cone denned by 

C uo ={p£ R 3+1 \u(p) = n , n < u(p) < 5}, 
and ci° ' 5 ' be a truncated light-cone defined by 

CjM ={ P e R 3+1 \u(p) = n , < u(p) < 5}. 
First of all, we require that the data of (1.1) is trivial on C UQ — Cu^\ i.e., 

<p(x) = 0, for all xeC^-CM. 



Therefore, according to the weak Huygens principle, the solution of (] 1 . 1 p is zero in Region 1, i.e. 
the future domain of dependenc of C UQ — cl° ' 
particular, ip = on C up to infinite order. 



the future domain of dependenc of C UQ — Cu^. That is, ip(x) = if u(x) < and u(x) > uq. In 



Secondly, we prescribe ip on c4° by 

ip(u,u ,6) = - — ^(v,^), (3.1) 
|n | 

where ipo : (0, 1) x §> 2 — > R is a fixed compactly supported smooth function with 1? norm 

approximately Eq. The factor — is natural because it manifests the correct decay for free 

n 

waves. 

The data given in the above form is called a short pulse data, a name invented by Christodoulou 
in [2]. In his work, he prescribes the shear (more precisely, the conformal geometry) of the initial 
null hypersurface in a similar form as ( |3.1| ) . The shear in the situation of [2] is exactly the initial 
data for Einstein vacuum equation. 

We remark that the above data is not a small data in the following sense: the derivative of 
the data can be extremely large if 5 is sufficiently small. In fact, this can be easily observed 
d 

once we take — — derivatives. We also remark that the energy flux of the data is approximately 
ou 

E on Cf/ 1 which is far away from 0. 

In sequel, for most of the computations, we need commutator formulas and we collect all of 
them here. Denote Lie derivatives D = Cl and D = Cl, then 

[Cn,f] = 0, [D,J}=0, [D,f] = 0, 

[□,o] = o, [D,n] = o, [D,n] = o, ^ 
1 2 „ P _, 1 _ 2 „ 



[□,L] = -2(L-L) + -£, p,L] = ^(L-L) 
If we commute O with ( |1.1[ ) n times, with respect to (3.2), we have^] 

□rrv= Q(vo p <^,vov) ; (3-3) 

p+q<n 

1 We shall ignore the numerical constants since they are irrelevant in the context. 
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and these Q's may be different. W commute L,U with (1.1) n times, with respect to (3.2), to 
derive 



p+q<n p+q<n 

- ^(LSTV - LQ n cp) + -Ml n ip. 
We commute L, Q with ( |1.1[ ) n times, with respect to ( |3.2[ ), to derive 

= ^ Q{VLQ p (j), Vfi«$ + ^ [L,Q](Vfi p 0,VfiV) 

p+g<n p+q<n 

+ -^(LSTV - L0» - 



(3.4) 



(3.5) 



We remark, thanks to (2.9), we have the following pointwise estimates which gains a factor in 
it, 



|[L,Q](V0,W)| + |[L,Q](V</>,VV>)| 



< 



rdwii^i + \rm<f>\ + iwii^i + mwm + \mm + 



(3.6) 



We now derive some preliminary estimates for the data on C Uo . In view of (3.1), by taking 
derivatives in L or direction, we have 

\\ Li p\\l^(c U0 ) < S'^Uq 1 , 
In fact, by taking L or y derivatives consecutively, for k £ Z>o, we obtain immediately, 



\\LT<P\ 
\f k+1 p\ 

\L 2 f k p\ 



L°°(C UQ ) ^fc 



<kS 2 U , 



-4„.-fc-l 



L°°(C U0 ) ^fc 



<fc 52 



(in 



-fc-2 



(3.7) 



-fe-i 



With the assistance of the original equation (1.1), one can further derive the L°° estimates for 



derivatives of p involving L directions. For this purpose, we first rewrite (1.1) in terms of null 
frame, namely, 



— LLip + Aip H — (Lp — Lp) 
r 

= 2Q u LpLp + 2Q 3a Lpf a p + 2Q Aa Lpf a p + Q ab f a pf b p. 
To estimate Lip, we observe that ( |3.8| ) can be written as an ODE for Lp as follows, 

L{Lp) = a ■ Lp + b, 



(3.8) 



Since the exact numerical constants are irrelevant, we shall ignore the constants appearing in the coefficients. 
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where 



(-+2Q M Lp + 2Q" a f a p), 

Up + /fa- 2Q Aa Lpf a p - Q ab f a p%cp. 



According to (3.7), we have, 



|a|U<»(C K) ) £ 6 2 «(A 
ll b IU°°(C« ) 



* . 
We also have 

L|Ly>| < \L(Lp)\ < \a\ ■ \L(p\ + \b\. 
In view of the fact that Lp = when u = 0, by Gronwall's inequality (see Section 2.3), we have 



|Ly>(«)| 



< 







>b(r)dr < 5^. 



Hence, 



\\fa\\ L ^{c Uo ) ^ <^u 2 . 



(3.9) 



To estimate Lj7p, we first commute with Q, that is, taking n = 1 in (3.3). In null frame, 
we rewrite the equation as 

—LLflp + Mlp + l (LQp - Lflip) = 2Q 1 {VQ,p, Vp) + Q 2 (V</>, V0). 
r 

We then proceed in a similar way as above, by Gronwall's inequality, we obtain 

\\mp\\ L ^ {CuQ) < 5 1 iu 2 . 



Therefore, according to (1.3), we have 



\Lf<Ph°°(C U0 ) < ^u 3 . 



Similarly, we can commute (1.1) with two and three f2's and obtain 

\\Lf 2 p\\ L ^ (Cuo ) < S^Uq*, 



\\Lf 3 <p\\ L ~ {Cuo) <6*u ! 



(3.10) 



(3.11) 



Remark 3.1 (Key: Relaxation of the Estimates). To obtain existence theorems for (1.1), we 



have to derive certain estimates on p (as well as on its derivatives). Those estimates must 
be valid on the initial hypersurface and they should propagate along the evolution to later null 
hyper surf aces. For this purpose, we shall use a slightly weaker version of estimates for ^ ip than 
those in (3.7), namely, the following estimates 



f k+1 p\\ Lxi(Cuo) < k u 



-k- 



One expects it should be easier to prove the relaxed estimates propagating along the flow of (1.1) 



than the original ones in (3.7). This is the precisely relaxation of the propagation estimates 



mentioned in the introduction. 
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To summarize, on the initial null hypersurface C Uo , under the form of the short pulse data 



(3.1 ), up to fourth derivatives of <p (this is the minimal number of derivatives we need to proceed 
an bootstrap argument, see next section), we have the following relaxed L°° estimates, 



for k = 0, 1, 2, 3, as well as 



\Lf%\ 



fc-1 



7fc+l II <- 

- 1 <u, 



<P\\L°°{C U0 ) 
\\Lf k ip\\ Lac{Cu j 



(3.12) 



L f<p\\ L ^ {Cuo) < 5 5u , 





(3.13) 



For wave equations, we expect the the information propagating along evolution should be 
more or less contained the energies of the solutions, i.e. L 2 norms of derivatives of ip. This 
heuristic leads us to consider the L 2 norms of the data on C uo . 

According to the L°° estimates in (3.12) and (3.13), we obtain immediately the following L 2 
estimates (oberve that the area of C UQ is comparable to 5 ■ Uq), 



\Lf k (p\\ L *(C U0 ) £ 



for k = 0,1, 2, 3, and 



\t + \\\L^ o) <^-\ 



\L 2 <p\\ L °°(C U0 ) ^ $ 1 , 



(3.14) 



\L 2 f v \\ L2{Cuo) < r 1 



L 2 f\\\ LHCuo) < 5 



-l„.-2 



(3.15) 



We the remark that those L 2 estimates are also relaxed estimates. In next section, we shall show 
that, up to a universal constant, the estimates in (3.14) and ( 3.15| ) (the parameter uo will be 
replaced by u) will hold on all later outgoing null hypersurfaces C u where uq < u < — 1 provided 
that the solution of (1.1) can be constructed up to C u . 



4. An A priori Estimates up to Fourth Derivatives 
This section is the technical heart of the paper. We assume that there exists a solution of 



(1.1) defined on the domain T> u ,u which is enclosed by the null hypersurfaces C u , C_ u , C UQ and 
C_q. The goal is to show that estimates (3.14) and (3.15), which are valid on C UQ , also hold on C u . 
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(4.1) 



We slightly abuse the notations: we use C u to denote C u (i.e. yf G [0,n])and C_ u to denote 
Qt M . We now define a family of energy norms as follows, 

Ei(u,u) = \\L(p\\ L 2 {Cu) + &~ 3 \u\?\\f<p\\t2(c u )> 
Ki(u,u) = Wftph^cj +5~ 5 H *\\L<p\\ L 2{cj, 
E 2 (u,u) = \u\\\Lf<p\\ L 2 {Cu) +5- 1 2\u\^\\f 2 i P \\ L 2 {Cu) , 
E 2 (u,u) = \\\u\f 2 <p\\ L2 ^j + 6-2\u\^\\u\Lf(p\\ L 2 { cj, 
E 3 (u,u) = \u\ 2 ^Lf 2 ^\\ L 2 iCu) + 5-^\u\l\\f 3 ip\\ L 2 {Cu) , 
Ea(u,u) = |||n| 2 yVllL2 ( cj + ^^|5||k| 2 LyVllL2(czj, 
E±(u,u) = \u\^Lf\\\ L 2 {Cu) +8-^u\^y\\ L 2 {Cu) , 
E 4 (u,u) = \\\u\ 3 f 4 ip\\ L 2 { cj + S-^u\^\\\ u fLf 3 ^\\ L2{a j. 

We also need another family of norms which involves at least two null derivatives. They are 
defined as follows, 

F 2 (u,u) = 5\\L 2 Lp\\ L 2 {Cu) , 
F 2 (u,u) = \u\z\\L 2 <p\\ L 2 { cj, 

%|||L 2 yV|| L2(Cii) , 

\ufi\\\u\L 2 ftp\\ L 2 { cj, 
5\u\ 2 \\L 2 t^\\ L 2 (Cu)} 



F*(u, u 



F/i(u, u 



(4.2) 



\U\ 2 \\\U 



2 L 2 f 2 ip\\ L2{ c u) . 



We shall prove the following propagation estimates, 



(4.3) 



Main A priori Estimates. If 5 is sufficiently small, for all initial data of (1.1) and all 

I 4 6 R>o which satisfy 

Ex(u ,d) + E 2 (u ,S) + E 3 (u ,d) + E 4 (u ,S) 
+ F 2 (u , 5) + F 3 (u Q , 5) + F 4 (u , 5) < h, 

there is a constant C(I^) depending only on I4 (in particular, not on S and uq), so that 
4 4 
Y,[Ei(u,u) +E l (u,u)} + Y^[F j (u,u) +F ] (u,u)} < C(h). (4.4) 

i=l j=2 

The subindex 4 in I4 denotes the number of derivatives used in the energy norms. 

4.1. Bootstrap Argument. We will perform a standard bootstrap argument to prove the 
Main A priori Estimates. We assume that 

4 4 
Y^WM) +E l {u\u!)] + Y J [F j (u',yf) +F j (u',u')] < M, (4.5) 

i=l j=2 
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for all u' G [«o,«] and u' £ [0,u], where M is a sufficiently large constant. Since we have 
assumed the existence of the solution up to C u and C u , we can always choose such a M which 
may depend on the p. At the end of the current section, we will show that we can actually 
choose M in such a way that it depends only on the norm of the initial data but not the profile 
cp. This will yield the Main A priori Estimates. 



4.2. Preliminary Estimates. Under the bootstrap assumption (4.5), we first derive L°° for 
one derivatives of <p. As a byproduct, we will also obtain the L 4 estimates for two derivatives of 



ip. For this purpose, we shall repeatedly use the Sobolev inequalities stated in Section 2.3 

We start with Lp. According to Sobolev inequalities, we have 

i I I i I 

\u\2\\Lp\\ L 4 {s&u) < \\L 2 p\\l 2{Cu) (\\Lp\\l 2{Cu) + \u\2\\Lf V \\l 2{Cu) ) 

< (5 _1 M)5(M5 + |u|3(|«| _1 M)3). 

Hence, 

\\Lp\\ms^ u )<^\u\-^M. (4.6) 

Similarly, we have 

\u\HLfp\\ L , {Su _ u) < \\L 2 yp\\\ 2{Cu) {\\Lfp\\l 2(Cu) + \u\HLf 2 p\\l 2{Cu) ) 

< ((rvr^^kr 1 ^ + \u\^{\u\' 2 m)^). 

Thus, 

\\Lf<p\\ LHs&u) <5-t\u\-lM. (4.7) 



(4.8) 



i 



Combine (4.6) and ( |4.7| ), we have 

\\L(ph°° < \\Lip\\ L 4 {s&u) + |«|3 \\Lfip\\ L 4 {s&u) 

< 5^\u\- l M. 

We now treat J7p. According to Sobolev inequalities, we have 

Mh?<pu*(Se, u ) < \\ L ?v\\h {Cu )(\\fv\\h(c u ) + \ u \Hfw\h {Cu) ) 

< (|n| _1 M)5((55| n |"5M)5 + \u\^(5^\u\~%M)^). 

Thus, 

lifeline) <^\u\--±M. (4.9) 

Similarly, 

\W\\\l^, u )<^H-Im. (4.10) 



Combine (4.9) and (4.10), we have 



W<P\\l°° < \u\ a 11^11x4(5 ) + \u\*\\¥ 2 <P\\l*(Su,u) 



< 5i\u\~±M. 

It remains to estimate Lcp. For HL^H^, according to ( 2.12[ ), we have 

i i i 

WLfph*^) < \\L<p\\ L 4 iS u,u ) + WL. v>\\h(cj(W\ u \~ Z&Wbugj + \\^¥v\\htc u 



(4.11) 
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Since first three terms are appearing in the bootstrap assumptions (or on the initial hypersurface 
H Uo ), we can control them exactly as before. For the last term, we can restrict the inequality 
on the part of C_ u where the affine parameter u' of L is in the interval [uq, u]. Thus, we have 

\-^Ly(u\u,6)\\ L 2icj < {u^WLifWL^cj- 
The righthand side is again a term in (|4.5|). This allows us to derive 



\\Lp\\ 



Similarly, we can derive 



\L?<p\\l*(Su,«) Z 5±\u\- 2 M. 



We combine (4.12) and (4.13) to derive 

\\L<p\\l™ <^\u\~iM. 



(4.12) 
(4.13) 

(4.14) 



In the same way, we can derive L 4 ,L°° estimates of two derivatives. We summarize all the 
estimates in the following proposition. 



Proposition 4.1. Under the bootstrap assumption (4.5), we have 

1 17 13 

#2 |it|[|L<£>||£oo + 6~4 \u\± ||y^||x,oo + 5~4 \u\ 2 H-Lv'IU 00 
+ 5^\u\ 2 \\Lfp\\ L oc + 5~^\u\^\\f 2 tp\\ L °° + (5"3|ti|l||Lyyj|| LO o < M 

^\u\l\\Lf 2 ip\\ LHs ^ u) + <5-^|i/| J T||y 3 ^|| jL4( ^ ) + 5-i\u\ 3 \\Lf 2 ip\\ LHs ^ u) 
+ 52 \ u \ I \\Lfip\\ LHS&u) + 5~* \u\ J \\Y 2 <p\\ L *(s & u) + s ~ * W 2 \\L? ( p\\lHSu,u) 
+ ^W*\\L<p\\l*(Su,u) + S ~* \ u \~ a Wftph^s^u) + 8~Mu\\\L<p\\ lHSau) < M. 

1 3 

We observe that L°° estimates on Lip (which of order 5j|u| - 2) is certainly worse than the 
initial estimates of Lip on C UQ (which is of order <5a |^ |~ 2 ). To rectify this loss, we derive a 1? 
estimates of Lip on C u (instead of C_ u appearing in the definition of E_\{u, u)). 



Lemma 4.2. Under the bootstrap assumption (4.5), if 5?M is sufficiently small, we have 

\\l<p\\l*(c u ) < %r x M. 



Proof. We multiply Lip on both side of the main equation (3.8) and integrate on C u . In view of 
the fact that Lip = on So, u as well as (2.11), this leads to 

1 



\LpY 



+ 



n r 



-\Lp\\Lp\ + \^p\\Lp\ + \fp\\LpY 



(4.15) 



\Lip\\Lp\ 2 + \L^\\fp\\Lp\ + \fip\ 2 \Lp\, 



c- 
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where the integral J c u means JjfJs , Let / 2 (^) = fc-^—^ 2 ' We now estimate the terms 

at the right hand side of ( 4.15| ) one by one. We have, for the first two terms 



-\Lip\\L<p\ < \u\~ l f(u)M, 



i<p\\L<p\<8*\u\-*f(u)M. 



For the second two terms, we have 



r- 



c- 



For the last two terms, we have 



\f^p\\L^\ 2 <5l\u\-lf{u)M, 
\L<p\\L<p\ 2 < 8- 1 2\u\- 1 f(u)M. 

\L^\\f V \\L V \<5^\u\-lf(u)M, 



Wy\ 2 \Lv\<5±\u\-±f(u)M. 



Back to (4.15), we have 



du 



m^MiS-^u^fiuf + lu^fiu)), 



We then integrate on C u to derive 

\u\ \u\ 
Since 5^M is sufficiently small, this completes the proof 



□ 



4.3. Energy Estimates for Ek(u,u) and E k (u , u) when k < 3. Commute 0* with (1.1), in 
view of (3.3). We use the scheme in Section 2.1 for this equation where we take 4> = ^V>^ = 
0, 1, 2 and X = L. In view of (|2.6|), we have 



m i ip\ 2 + I |yovi 



1«,.|2 



V 



\m l ip\ 2 + jj Q(vn % <f>,v<f>)m % <p 



p+q<i,p£i,crf4 

At this point, we rewrite the above equations as 



Q(vn p ( j),vn q </))Ln i <p + JJ -LnV-^V 



(4.16) 



m l ip\ 2 + R + S + T. 



( u 



where R, S and T are defined in an obvious way. Before deriving the estimates, we remark that 
for any function <j>, we actually have 



\LP(Su, u ) 
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which can be easily derived from (1.3). 

Let us first consider R, i.e. the second integral terms on the right hand side of all equations 



(4.16), in view of (2.11), it splits into 



itl — 


// 1 T ,n\ 1 T O i ,n\ 2 

IJ \LlV\\ l * 1 f\ ) 


R2 = 


/J^\fip\\m^\ 2 , 


R 3 = 


£|L^||LOVII^VI, 


R4 = 


£l^liyovil^vi, 


R 5 = 


U IftpWMFipWLtftp], 


Rq = 


ft {\L<p\ + \f<p\)\fw<p\\Lrt<p 



where i = 0, 1, 2. Now we bound those terms one by one. 
For R\, we have 



Ri< f \\L.<p\\l°°([ \LS?tp\ 2 )du' 



5i\u'\-'iMM 2 du' 

Uo 

< 5i\u\'^M 3 . 

For i?2, we bound J7(p in L°° then proceed exactly as above. This bound is better than R±s and 
we shall use the worse one, 

R 2 < 5^\u\-^M 3 . 

For i?3, we have 



R 3 <( \L^\ 2 \m\\ 2 )H |LfiV 



V 



v 



= (/ \\L V \\ 2 L ^m\\\ 2 L2{Cui) du'Y^j \MLf\\\ 2 L2( c ul) dy!)i 



For i?4, since i < 4, we can use L estimates in Proposition 4.1 This is an easy but important 
observation since we are dealing with terms with lower (less that 4) order derivatives. And for 
the highest order derivative terms, we can not use this approach. We then have (note that we 
bound yVt % ip by L A instead of 1? to gain <5s.) 
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For i?5 and Re, they can be bounded exactly in the same way as R4, thus, 

R 5 <6hu\-lM 3 , 



Rr < 5i\u\-iM 3 . 



We now move to S, i.e. the third integral terms on the right hand side of all equations (4.16). 
A general form for the integrand can be written schematically as 

s = jj |vfi*vi|vftVII^VI- 

Notice that at least one V in this formula is not L. Thus, we can estimate this term exact in 
the same way as we have done for R4. This leads to 

S < dhur^M 3 . 



Finally, for T, i.e. the last integral terms on the right hand side of all equations (4.16). It is 
defined as 

T = jj Jl^VII^VI, 



We bound it by 



T < 



1 

w 



-\\m l ip\\ L 2 iCul) \\m l if\\ L 2 iCul) du' 



'n 



Putting those estimates in (4.16), in view of the size of initial data (3.14) as well as M > 1, 
we obtain 



-,2 , „| 2 



Hence, 



for i = 0, 1,2. 



\m^\\ L 2 (Cu) + ||yoviU 2( aj < h + ^ \u\-*m 



1 3 
2 , 



(4.17) 



Next we switch X to be L. In view of (|2.6|), we have 

ly^vi 2 - 



C U 



i,-|2 



\m l <p\ 



Cur, 



yovr+ // Q(vnv,vy?)xnv 



v 



^ p+<?<*,p<*,<?<* ^ 

for z = 0, 1 and 2. At this point, we rewrite the above equations as 



( u 



yoVI 2 + r+s + t. 



(4.18) 



where R, S and T are defined in an obvious way. 
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We start with R, i.e. the second integral terms on the right hand side of all equations (4.18) 



in view of (2.11), it splits into 



Ri = J^(|L^| + |^|)|LftV| 2 , 

\f(p\ + u^DlLfiVIU^VI, 



R2 

Rs 
R i 



v 



V 



V 



(l^| + |^|)|yovil^Vl, 
Ix^nyoVll^VI- 



We bound those terms one by one. For Ri, in view of Lemma 4.2 we have 



/ (\\Lp\\ L ~ + \\fp\\ L ~)\\m l v\\l 2{ c ul) du' 



< dhu\- 2 M 3 . 



For i?2j R3 and R4, in view of Proposition 4.1, we bound the three factor in the integrands in 
L 4 , L 4 and L 2 , hence, 

R 2 + Rs + R4< <5t|u|~§M 3 . 

We remark that for i?4, it is necessary to bound )P$V"(p in L 4 instead of I? . In this way, one can 
1 

gam an extra 4 . 



We now move to S, i.e. the third integral terms on the right hand side of all equations (4.18). 
A general form for the integrand can be written schematically as 



S = jj |VlF</?l|VftVII^VI 



Notice that at least one V in this formula is not L. Thus, we can estimate this term by bounding 
the three factors in the integrands in L 2 , L 4 and L 4 . This leads to 

s < dhur '^M 3 . 



For T, i.e. the last integral terms on the right hand side of all equations (4.18), 

1, 



T 



(LO'VII^VI, 



v 



we shall use a different approach. This is closely related the so called reductive structure in 
Christodoulou's work [2j. Roughly speaking, at this point, one have to proceed the estimates in 
a correct order and one has to rely the estimates derived in previous steps. This is not transparent 
in the current work, because instead of deriving the estimates in some kind of arbitrary order, 
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the order of the estimates we do here is carefully chosen. We bound T as follows 



T < 



1 



-|LQVII^VI 



< 



v F 
u g 



^|lov| 2 + tI^VI 

r o 



i._i2 



7T2ll Lri VllL2(C u ,) 



The first term in last line has already been controlled in (4.17), so we have 

1 



u § 



T< / (ll + 5z\ u \-2M 3 )du' + 



o 



5\u\- l ll + 5i\u\-2M 3 + 



1 







In view of the size of initial data (|3.14|), we plug the above estimates into (4.18) to derive 

1 



|yOV| 2 + / |L^V| 2 < 5\u\- v ll + 5'i\u\-^M A + 



1 t1 



, r3 



o 



Since HLfiVH^^ ^ also appears on the left hand side, a standard use of GronwalPs inequality 
removes the integral on the right hand side. This yields 

||yfiVlb(C u ) + ll^Vlli^CJ < 55M-5l4 + <fsM~*Mi. (4.19) 

Putting (4.17) and ( |4. 19 ) together, we derive the energy estimates for one derivatives of tp as 
follows, 

E k (u,u) + E k (u,u) </ 4 + <^M~*mI, (4.20) 

for k = 1, 2 and 3. 

4.4. Energy Estimates for Fk(u,u) and F_ k (u,u) when fc = 2,3 and 4. We start with 
FjJ u, jx)'s. By commuting L and with (1.1), in view of (3.4), we use the scheme in Section 



2.1 



for this equation where we take 4> = LQ' t ip with % = 0, 1, 2 and X = L. We then have 



t._|2 



<2„ 



|L^vi + // Q(VLn l (p, V(p)L 2 n l (p 



+ 



JJ Q(vl^,vov)£ 2 ^V + JJ Yl [L,Q](w p <f),vn q <p)L 2 n l ip (4.21 



p+q<i ,p<i,q<i " p+g<* 

1 2 1 



/-p H r r 

At this point, we rewrite the above equations as 



|L 2 OV| 2 



i,„|2 



L 2 QV| 2 + -R + <S + T + {7, 



where R, S, T and £7 are defined in an obvious way. 
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First of all, we consider R, i.e. the second integral term on the right hand side of (4.21), in 
view of (2.11), it splits into 

R x = JJj\Up\ + \f^\)\L 2 ^ip\ 2 , 

R2 = ff (\?<p\ + M)|^nvi|£ 2 ^VI, 

R3 = ff(\y<p\ + \l<p\ + i^Diy^vii^vi- 

We will bound the three factors in these integrands in L°°, L 2 and L 2 respectively. 
For Ri, we simply bound \Lip\ and in L°° and obtain 

Ri ^ <H|«Hm 3 . 



For i?2; we first need 1? estimates on LLfl l ip. According to (3.8), we have 

\\LL(p\\ L 2 {Cu) < \\fi(p\\ L 2 iCu) + \\-Lip\\ L 2 {Cu) + \\-L(p\\ L 2 {Cu) 

+ \\L<pL<p\\ L2{Cu ) + \\L<pf<p\\i^(c tt ) + \\ L 9f^\W{c a ) + \\\ftp\ 2 \\L2(c u ) 
For those quadratic terms, we bound one of them in L°°, thus, we have 

\\LL<p\\*(C u ) < H~ l M. (4.22) 

For LLfflp's with i = 1,2, we can proceed exactly in the same way to derive (we also use Sobolev 
inequalities) 

ii _ r\\L (c u ) ~i i^ ^ 
||LL^-Vlk4 (s ^) <rs|«|-iM. 

Therefore, we bound and |Lc^| in |LL$7*c/?| and |L 2 f2*<^| in L 2 , this yields 

R2 < <H ju^M 3 . 

For i?3, similarly, we have 



Secondly, we consider S, i.e. the third integral term on the right hand side of (4.21). It is 
bounded by the sum of the following terms 

51 = jj + |ynVI)|£ 2 « p ¥>P 2 fiVI, 

5 2 = ^" {\yWip\ + \m q ^\)\Lm p ^\\L 2 Q\i 

s 3 = JJ (\fw<p\ + \m«ip\ + \Ln«v\)\fmPip\\L 2 ni<p\, 

where p + q<i, p<i and q < i. Since the numbers of derivatives in the first factors are not 
saturated, we will bound the three factors in these integrands in L 4 , L 4 and L 2 respectively. 
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For Si, we have 



Si < 



Ju 



< 5*M 



'0 JUQ 

where j = q, q + 1 and we have used the following Sobolev inequalities for last line, 



\L 2 n^\\ LHs ^ u) < \u\ 2\\L 2 niip\\ L 2 (s ^ u) + \u\2\\fL 2 niip\\ L 2 is&u) . 



Thus, we have 



5i <6*M riu'l-illi^^ll^^^lli^VII^^) + k'rill^^+Vll^^oll^VII^^) 

Juq 



< 5"4|ti|-2M 3 . 

For S2 and S3, similarly, we obtain 

S2 + S3 < ^llul^M 3 . 



Thirdly, we consider T, i.e. the fourth integral terms on the right hand side of (4.21). It is 
bounded by 

t = jj ^- (l^vil^VI + iy^l^VI + I^HI^VDI^VI, 

^ P+<7<* 

The strategy is to control the three factors in the integrands either in L°°, 1? and L 2 or in L 4 , 
L 4 and L 2 respectively, depending wether the number of derivatives are saturated or not. We 
omit the details since the proof is exactly the same as for R and S terms. We obtain 



Finally, we consider U, i.e. the last integral term in (4.21), we simply estimate two factors in 
the integrands in L 2 and L 2 . This gives 

U < S^M^M 2 + 5'hu\-^M 2 . 



Putting all the estimates back to (4.21), in view of the size of data (3.15), we have 
! |L 2 OV| 2 + f \fLn l ^\ 2 < b- 2 l\ + 5-i\u\- 1 2M 3 . 



for i = 0, 1 and 2. This is equivalent to say that 



(4.24) 



for k = 2, 3 and 4. 



We now derive the estimates for F k (u,y,ys. Since the proof can almost be converted word by 
word from the proof we just preformed for Fk(u, u)'s, in stead of giving all the details, we only 
sketch the idea. 
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By commuting L, Q, with (1.1), in view of (3.5), one can apply the scheme in Section 2.1 for 
this equation by take <f> = L0 l <^, i = 0, 1, 2, and X = L. Thus, (2.6) reads as 

f \fm l ip\ 2 + I |L 2 ^V| 2 = / \fMl i i P \ 2 + ft Q(VLftV,Vv?)L 2 ftV 

+ 11 J2 Q(VLn p <P,y^ q <P)L 2 n i (p + II [L,Q}(vn p <p,vn q <j))L 2 n l ip (4.25) 

p+q<i,p<i,q<i P+q<i 



/pr r r 

At this point, we rewrite the above equations as 



I \fU} i i P \ 2 + I \L 2 n i ^\ 2 =l IfLWtpf + R + S + T + U, 

J Cu J C J Cur, 



C u —u •* C UQ 

where R, S, T and U are defined in an obvious way. 
For R, we bound it by the sum of the following terms 



R x = jjj\Ltp\ + \f<p\)\L 2 Wv\\ 

R2 = ff(\?<p\ + I^DI^VII^VI, 



Rs = Jljxfvi + \l<p\ + i^oiy^Vii^vi- 

We then bound the three factors in the above integrands in L 2 and L 2 respectively. This 
will yield directly 

R 1 <5 1 2\u\- 2 M 3 , 
R 2 <6%\u\-*M 3 , 

and 

R 3 < 5\u\~ 3 M 3 . 
For S, it is bounded by the sum of the following terms 

Si = jj (|£«vi + \yw<p\)\i?w<p\\!itf(p\ } 
s 2 = jj (|yovi + |£^VI)I^VII^VI, 
s 3 = jj (\fn\\ + \m«ip\ + \LWip\)\fm p tp\\L 2 n l <p\, 

where p + q<i,p<i and q < i. 

Since the numbers of derivatives in the first factors are not saturated, we can bound the three 
factors in these integrands in L 4 , L 4 and L 2 respectively. This yields 

Si <55| n |- 2 M 3 , 

and 



S 2 + S 3 < Ji|u|-fM 3 . 
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For T, typically, there are bounded by 

s 7 =[f±- (\fn p <p\\Ln q <p\ + \fn p <p\\m q <p\ + \m p <p\\m q t P \)\L 2 n i <p\, 

^ P+Q<i 

We control the three factors in the integrands either in L°°, I? and L 2 or in L 4 , L 4 and L 2 
respectively and obtain 

T < d'i\u\~^M 3 . 

For U, we control the two factors in their integrands in 1? and 1? and obtain 

U < S^\ U \- 2 M 2 + 5\u\~^M 2 . 



Putting all those estimates together, in view of (3.11) and the fact that \u\ < 1, we derive 

/ \L 2 n^\ 2 <5 2 \u \- 4 I 2 + 8^\u\- 2 M 3 . 

for % = 0, 1, 2. This is equivalent to say that 

Fk(u) <5\u \^h + 5*\u\-*M%, (4.26) 

for k = 2, 3 and 4. 



4.5. Estimates for E^{u,u) and E 4 (u,u). First of all, we commute Q three times with (1.1), 
that is, taking n = 3 in (3.3), this yields 

□oV= Yl Q(vn*v,vov)- 

p+q<3 



For E^(u,u), we use the scheme in Section 2.1 for this equation by taking (f> = ft 3 ip and 
X = L. In view of (|2.6|), we have 



j \m z tp\ 2 + J \fn 3 <f\ 2 = I \LQ 3 (p\ 2 + II m 3 (pQ(yn 3 (p,v<p) 

+ if LQ 3 <p Q(Vn p ip,Vn q (p) + (I -LQ 3 ip ■ LQ 3 



(4.27) 



ip. 



At this point, we rewrite the above equations as 

I \Ln 3 ip\ 2 +l \fn 3 ip\ 2 =l \m 3 ^\ 2 + r + s + t. 



c u J Q-u J Cu 

where R, S and T are defined in an obvious way. 

We claim that the estimates for S and T are easy and we can bound them by 

5 < 5^\u\-^M 3 , 

and 

T < 5\u\~ 2 M 2 . 

To be more precise: since the numbers of derivatives for the integrands of S are not saturated 
(i.e. only one term has four derivatives), we can bound the three factors in the integrands in 
L 4 , L 4 and L 2 respectively; for T, we simply bound the integrands in two L 2 's by Holder's 
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inequality. The actual proof goes in the same way as in previous the section and we omit the 
details. 

It remains to bound R. This terms can be bounded by the sum of the following terms before, 



R 2 = Jf (\L<p\ + \Y<p\)\W?<p\\LSlPip\, 

R 3 = jj(\f<p\ + \LfP\ + \L<p\)\rn 3 <p\\Ln 3 v\. 

We remark that, because this is about the top order derivative estimates, we have to place 
the last two terms in the above integrands in L 2 and the first one in L°°. In such a way, we can 
easily derive 

R l < 5*\u\-iM 3 , 

and 

R2 < ^|«| _1 M 3 . 

The estimates for R% are more difficult and require the knowledge of all the previous estimates 
derived so far. Before going into the details, we would like to explain how the difficulties 
appear. It is intimately related to the relaxation of the propagation estimates. One may expect 
||yr2 3 (/9||^2( ( 7 i( ) behaves as 5? in view of the initial data. But in reality, because we are using a 

relaxed version of propagation estimates, we lose automatically 82 . Therefore, if we treat R3 in 
the same way as for R\ and R2, we will not get any positive power in 5 and therefore we can 
not close the bootstrap argument. 

To around the difficulties, we recall that in previous sections those L°° estimates (say on 
Lip) are directly derived from the bootstrap assumptions via the Sobolev inequalities. The key 
observation is, if we make use of the estimates derived so far in previous sections instead of the 
bootstrap assumptions, we can indeed improve the L°° estimates for Lip. This improvement will 
be just good enough to enable us to close the argument. 



We first improve the L 4 estimates for LClip, according to (4.20) and Sobolev inequalities, we 
have 

M £ iiljviu^) < \\L 2 n<p\\ \ 2{Cu) {\\l^\\ \ 2{Cu) + \u\ I \\fL^\\l 2(Cu) ) 

< (T3(J| +^M"BMi)(i| +<5ts|u|"5M3) ( 4 - 28 ) 

< (T5(/ 4 + 55|u|-3M§). 

This implies better L°° estimates for Lip, once again via Sobolev inequalities, as follows 

\\L<p\\l°° < \u\-2\\L(p\\ L 4 {S!i>u) + \u\^\\Lfip\\ L 4 {s ^ u) 
< 5~z\u\~\Ia + 8»\u\~iM^). 
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We now proceed to bound i?3 and we only consider the main terms with Lip (the others are 
much easier to control): 

Rs< jj \Lv\ 2 \u'\ 2 \fti^\ 2 + \u'\- 2 \m^\ 2 du'du 

f S 1 1 3 f U 

< / «5- 1 (/ 4 + «58|«r 3 M2) 2 ||yn 3 ¥ j|li2 (c ,)dv!+ I \u'\- 2 \\m 3 <pf L2{c ,) du '- 

JO " J UQ 

Finally, we put all the estimates together, in view of the size of the initial data on C uo , we 
have 

\\Ln 3 <p\\l 2(Cu) + \\fn 3 <p\\l 2(aJ </ 4 2 + jl|n|-5M 3 + / \ u '\- 2 \\m 3 ip\\ 2 L2(Cu/) du' 

- JUQ 

r$ 113 
+ / S~ x {h + 5^u\-iM2) 2 \\fVL 3 (p\\ 2 L2{c .dyf. 
Jo 



Thus, thanks to Gronwall's inequality, 

\u\ 3 \\Lf 3 <p\\ L2{Cu) + \\\u 3 \f 4 v\\LHcj 

< (exp |u| _1 + exp(/ 4 + 5& \u\~^M^) 2 )(h + 5*\u\~*Mi). 
Therefore, if 5 is sufficiently small, we have 

|n| 3 ||LV7V|| L 2 (Cu) + |||n 3 |yVllL2 ( cj < ^4 + ^|n|-2M§. 

Equivalently, 

E±(u,u) < U + 5*\u\~*M*. (4.30) 
This is the desired estimates for E^u, 1 ^). 



For E_4(u,u), we switch X to L. In view of (2.6), we have 



\fn 3 <p\ 2 + f \m 3 ^\ 2 = I \fn 3 <p\ 2 + II m 3 <pQ(vn 3 <p,v<p) 



(4.31) 



At this point, we rewrite the above equations as 

I \fn 3 ^\ 2 +l \m 3 ip\ 2 = I \fn 3 <p\ 2 + R + s + T. 

where R, S and T are defined in an obvious way. 

We claim that the estimates for S are easy since the numbers of derivatives for the integrands 
of S are not saturated. We bound the three factors in the integrands in L , L and L 2 respectively 
and we derive 

5 < 5 r i\u\-lM 3 . 

We can bound R more or less as before. First of all, it is bounded by the sum of the following 
terms 
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R l = JI^\L<p\ + \f<p\)\Ln 3 <p\ 2 , 

R 2 = ft J\IM + \y<p\)\Ltf<p\\MP<p\, 

R 3 = jf(\f<p\ + \£&\ + \Lip\)\fn 3 ip\\m 3 ip\. 



Once again, except for the last term in i?3, all the other terms are easy to control so we ignore 
all the them and assume 



r 3 = II^L<p\\fn 3 <p\\Ln 3 <p\. 



Therefore, we repeat the previous argument and make use the improved L°° estimates for Lip, 
we obtain 

R 3<H \Lip\ 2 \u'\ 2 \L_n 3 ip\ 2 + \u'\- 2 \fn 3 ip\ 2 du'dy! 



< 



[ S-\L 
Jo 



. 1 , , 1 . , 3 



4+ #8 Id 4M2 



^21 



Ml 3 ip\\ 2 L2(c ^du + / \u'\~ 2 \\ftt 3 ip\\ 2 2 ^Adu'. 



For T, thanks to (4.30), we have 



T<5\u\- l ll + 5 ! i\u\-iM 3 + - 5 J \\m 3 ipf^^ 
Putting all the estimates together, we have 

\\f^ip\\l HCu) + \\m 3 ip\\ 2 LHC j < 5\ u \- l il + si \u\~Im 3 + / \u'\- 2 \\fn 3 ipf L2 , c M 



du' . 



' -" 2 du' 



r 

Thus, thanks to Gronwall's inequality, we obtain 

\\fn 3 ip\\ L 2 {Cu ) + \\Ln 3 ip\\ L 2 { c u) < 55\u\-h 4 + 6*\u\-zML 



We then combine (4.30) and (4.32) to conclude 



E A {u) + E 4 (u) < h + 5s\u\-±M2. 



(4.32) 



(4.33) 



4.6. End of the Bootstrap Argument. We add the estimates in previous sections together, 
since \u\ > 1, we derive 



4 4 
i=2 



1 3 
2 



i=i 



By the definition of M from the bootstrap assumption (4.5), we obtain 



M <h + 5*M^. 
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By choosing 5 suitably small depending on the quantities I4, we conclude that, there is a constant 
C(Ia) depending only on I4, such that 

4 4 
Y}Ei{u) + E i (u)\ +J>i(«) + F j (u)\ < C{h). 

i=l j=2 

Therefore, we have completed the proof of Main A priori Estimates. 

4.7. Higher Order Derivative Estimates. For higher order derivative estimates, the argu- 
ment is completely analogous, in fact much simpler, because we have already closed the boot- 
strap argument and we can simply use an induction argument to derive estimates for each order. 
Therefore, we shall omit the detail and only sketch the proof. We introduce a family of energy 
flux norms for higher order derivatives: 

E k (u,u) = \u\ k - 1 \\Lf k - 1 ^\\ L 2 {Cu) + S-^u^-^lf^ll^), 

E k ( u ,u) = lui^iiyVllL^cj + ^^kl^iiiui^y^Vlli^cj, 

for all k > 1. Similar to the lower order derivatives cases, we also need a family of flux norms 
involving at least two null derivatives: 

F k (u,u) = 5\u\ k - 2 \\L 2 f k - 2 v\\ L i iCu) , 
F k (u,u) = \ U \ h 2\\\u\ k - 2 L 2 f k - 2 V \\ L 2 { cj, 

for all k>2. 

To achieve the high order derivative estimates, we will perform an induction argument. The 
beginning cases for the induction argument have already been verified, this is indeed the Main 
A priori Estimates obtained earlier, that is, 

4 4 
Y^[Ei(u,u) + E 4 {u,u)] + Y^[Fj(u,u) + < C(h), 

i=l j=2 

where I4 is the size of the data up to four derivatives. The higher order estimates are formulated 
as follows: 



Proposition 4.3. If 5 is sufficiently small which may depend only on k, for all data of (1.1) 
and all I n +2 £ K>o satisfying 

n+2 n+2 

Y J E l {u^5) + Y,F 3 (u ,5) < I n+2 , (4.34) 

i=l j=2 

there exists a constant C(I n +2) depending only on I n +2 (in particular, not on 5 and uq), so that 
[E n+2 (u,u) + E n+2 (u,u)} + [F n+2 (u,u) + F n+2 (u,u)} < C(J n+2 ), (4.35) 
for all u G [uq, — 1] and u G [0, 5] in the sense of a priori estimates. 

Remark 4.4. The index in I n +2 indicates the number of derivatives needed in the energy. The 
small parameter 5 may depend on n. In applications, since we only need the bound on at least 
10 derivatives on the solutions, we can ignore this dependence. 
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Proof. We now sketch the proof. Once again, we make the following bootstrap assumption: 

[E n+2 (u,u) + E n+2 (u,u)} + [F n+2 (u,u) + F n+2 (u,u)} < M, (4.36) 

for all u and u where M is sufficiently large. 

We proceed as before. First of all, we can derive preliminary estimates for higher order 
derivatives of 92, i.e. the L°° estimates up to n-th order derivatives and L 4 estimates up to 
(n + l)-th order derivatives. Those estimates are simply from the Sobolev inequalities and they 
are listed as follows 

5^\u\ i \\Lf i ~ 1 ip\\ L o +5~3| u |f+ i ||y i (/ 3|| ioo +5~^\u\i\\\u\ i ~ 1 Lf i ~ 1 (p\\ L ° a < M, 

5h\ufi +j \\Lf j tp\\ L 4 {S}iiu) +r*\u\* +j \\f j+1 <p\\ L 4 (Sjb j +5- 1 *\u\ 2 \MLf j ip\\ Li{s ^ u) <M, 

for all i,j £ {2,3, • • • , n}. In fact, based on the induction argument, we know that if i or j 
strictly less than n, we can replace the right hand sides of the above estimates by a constant 
depending only on I n +i instead of M. 

Secondly, we can perform the similar arguments as in previous sections to obtain energy 
estimates, this lead to the following estimates 

F n+2 (u,u) < I n+2 + 5^\u\-^M^, 

F n+2 {u,u) < I n+2 S^ + 5 l s\u\--+M 2 , 

1 13 

E n+2 (u,u) + E n+2 (u,u) < I n+2 + 8s\u\~4M*. 
Therefore, we can complete the proof by taking a sufficiently small 5. □ 

5. Existence of Solutions 
5.1. Existence in Region 2. In this section, based on the a priori estimates in last section, 



we first show that (1.1) with data prescribed on C uo where uq < u < 5 in last sections can be 
solved all the way up the t = —1, i.e. the Region 2. Recall that Region 2 is in the future domain 
of dependence of C_ Q and C uo (with < u < 5) and the data on (7 is completely trivial. 

To start, we use the local existence result [12] of Rendall for semi-linear wave equations for 
characteristic data, we know that there exists a solution around So, m , say, defined in the region 
enclosed by C , C uo and t = uq + e with e « 5. Thanks to the a priori estimates, if at the 
beginning we assume the bound on data for at least 10 derivatives, the L°° norms of at least 
up to 8 derivatives of the solution are bounded by the data on t = uq + e. Therefore, we can 
solve a Cauchy problem with data prescribed on t = uq + e to construct a solution in the future 
domain dependence of t = uq + e whose boundary consists of two null hypersurfaces C Uo +e and 
(7 e . Now we have two characteristic problem: for the first one, the data is prescribed on <2 and 
Cu +e] for the second one, the data is prescribed on C UQ and Q e . We can use Rendall's local 
existence result again to solve them around 5o, Uo + e and S etUo - In this way, we can actually push 
the solution to t = uq + e + e' with another small e' . 

We then can repeat the above process in an obvious way to push the solution all the way 
to t = uq + 5. Similarly, we can then push it from t = uq + 5 to t = —1. Therefore, we have 
constructed a solution in the entire Region 2. We remark that this process depends crucially on 
the a priori estimates since the L°° norms of the derivatives of <p is guaranteed to be bounded. 

Therefore, for a finite uq, a solution has been constructed in Region 2. 
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If we restrict the above solution to Cg, i.e. the future incoming null boundary of Region 2, it 
gives partially the initial data for (1.1) in Region 3. We now give a detailed description of the 
data on C_ s . 

Proposition 5.1. Assume we have bounds on Ei(uo,S) and Fi(uo,5) for i < n + 2 for some 
fixed n > 10 (say n=10). Then, for all p > 1 and q with p + q < n — 1, we have 

||yOVIU~(CJ<^M- 2 , for all u€[0,S\; 
HFOVIIioo^j^^H-f- 1 , for all ue[0,5}; 

Remark 5.2. The smallness on L p {l q (p only holds on the final hypersurface C_g. 

Proof. First of all, by losing one derivatives, we can retrieve a better L 2 (S UtU ) estimates for 
y/Q k ip than in the relaxed propagation estimates. For k < n, let 

h(u,u) 4 iiynVlU^)* 

thus, 

Lh 2 (u,u) = f L{f^ k f) 2 ~- I (?^V) 2 

/<? T Jq 

" >Ju,u " >Ju,u 

< 2\\Lfn k ip\\ L2{s ^ u) ■ h(u,u). 

where the factor — appearing in the second integral is the mean curvature of the incoming 

r 

hypersurface C s . Therefore, we have 



9 , ~~ fc . 



du 

We now integrate to derive 



h(u,u) < \\Lfn K <p\\ L 2 {s&u) . 



f u 1 

h(u, u) < h(u,uo) + / |— ^■|||?x / |Lyf2 fc (/?|| £( 2(5 
iiii) \ u I ~'™ 

< fc(u,«o) + ( / ^^') 5 I|M^ + Vlb ( cj 



<<5i|d- 2 . 



For the last line, we have used the bound on the data on C uo instead of the relaxed propagation 
estimates. It is precisely at this point one retrieves a 6%. 

As a consequence, we also retrieve an improved L 2 (C U ) and L 2 (C_ U ) estimates for ^I£l k (p: 



\fn k ip\\ L2(Cu) = ( / \\fn^\\i 2{Suu) dup < 5\u\-2, 
lyoVlUa^ = ( / \\fn k ip\\ 2 L2{Suu) d u 'p < 5*\u\-\ 

J un 



According to the Sobolev inequalities, we then obtain 
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This proves the first inequality. 

For the second inequality, we simply integrate L(L p+1 Q q ip). To illustrate the idea, we only 
consider the case where q = 0. The other cases can be treated exactly in the same way. We 
commute L with ( |1.1[ ) p times to derive 

L\L p+1 <p\ < \LL p+1 p\ 

< {- r + \L V \ + i^di^+vi + {\ + \l<p\ + + + i-o.t.. (5,2) 

where the lower order terms l.o.t. can be determined inductively. For example, when p = 0, 

l.o.t. = \fi<p\. 

when p = 1 , 

l.o.t. < \Q(VL<p,V<p)\ + l -\Q{V^,V V )\ 
+ \(\L<p\ + \L< f \) + \4LL<p\ + 1 \fi ( p\. 



when p = 2, 



1 _ 1 



l.o.t. < |Q(VLV,V^)| + \Q(VLip,VLip)\ + ^\Q(VLip,V<p)\ + ^\Q(y<p,Vcp)\ 
For p = 0, we have the following estimates 



I l.O.t - 1 1 room \ < <5 2 



.1 



-2 



dm 2 < <53|«r 2 . 

We integrate along L and use Gronwall's inequality to yield the correct estimates for Lip. That 
is 

\Lfp\<83\u\- 2 . 



Moreover, substitute this result into (5.2), it leads to 

\LL<p\ < 5-^\u\- 2 . 
In general, we proceed inductively, assume 

l#VI <s^\u\-p-\ 

\LU^\<5- L i\u\-v- 1 . 
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Then for p + 1 we have 

||l.o.t.|| io0 (cj < <H|u|~ p - 2 , 

ii + 1^1 + \m)LL^\\ L ~ { cj < Hm-^- 2 

\\\yu^\Wv\\\L~ ( cj<^\u\-v- 2 . 

We then integrate along L and use Gronwall's inequality to conclude. This proves the second 
inequality. 

The third inequality is a little bit surprising, since on expects L derivative causes a loss of 
5~2, which can been directly from the special choice of the profile of the initial data. The idea is 
that the loss in 5 only should occur from initial data but not from the energy estimates. Recall 
that the data is given by 

S 2 U 

cp(u,u o ,0) = i — 1^0(7,0), 
\u \ 

where the u-support of ifio is inside (0, 1). Therefore, on C Uo near Sq )Uq , the data is completely 

\\ Li p\\l^(c s ) % **m -1 . 



trivial. In particular, {L l Q, 3 p){uq, 5, 6) = 0. We then integrate (1.1) to get estimates on Cg. 
To illustrate the above idea, we now prove 



We rewrite (1.1) as 



LLip < -\L<p\ + \Up\\Ltp\ + |yV||Ld + l.o.t. 
r 

This can be view as an ODE for L<p on Cg with trivial data on Sg tUQ . We observe that and 
Lip are all of size J2|tt| -2 and the l.o.t. is of size <5 2 1^| — 2 we can integrate above equation and 
use Gronwall's inequality to derive 

II l ^IIl-(c,) ^ ^\u\-\ 

and 

\\LLp\\ Lx{ c s) < 5^\u\~ 2 . 
In the same way, by induction, we deduce 

Wp\\ Lao{ c s )<^\u\-\ 

\\LL^\\ Lx{ c s )<^\u\- 2 . 

When q > 1, we can proceed in the same manner. This completes the proof. □ 



5.2. Existence in Region 3. To show the existence of solution for (1.1), we have to solve a 
small data problem with data prescribed on Cg and C+. The data on Cg is induced from the 
solution in Region 2 and the smallness of 5 leads to the smallness of the data; the data on Ci is 
simply an extension by zero of the short pulse data prescribed on C uo , since Lip and all higher 



order derivatives of <p on Sg Uo are small (we have seen this in the proof of Proposition 5.1 ), the 



data in on C+ are also small. 
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We now prove a theorem similar to the classical small data results [6] and [7] of Klainerman. 
The approach we are going to use is inspired by the harmonic gauge based proof of nonlinear 
stability of Minkowski space-time from |lUj and of Lindblad and Rodnianski. Since all the 
arguments are more or less well-known and scattered in the literatures, we only sketch the key 
estimates. 

The following picture will be helpful for the structure of the proof: 



Region 3 




Let r denote one of the following vector fields: 



d d 

r G { — , Q i, S\i = 1, 2, 3}, 



where 



n 



'.i 



-n=x — 



_d_ 

dxi ' 



n _n 9 9 

ilQi — \ liQ — t— h Xi — , 

OXi ot 

r, 9 d 
S = t— + r— =uL + uL. 
ot or 

For cp, we define the fc-th conformal energy on Y>t as follows 



lal</c St i=l 



\a\<k 

where S T = {t = r} n Region 2. 

We also use deft to denote all possible derivatives, i.e. dej) £ {)P(ft, Lcf>, L4>}; we use ■ 
good derivatives, i.e. d(j) £ {y<P-,L ( t ) }- 

Therefore, the classical Klainerman-Sobolev inequalities implies 

< \ T?( con ^(A\ 

(1 + |u|)(l + |u|)a 



to denote 



When one commutes T's with (1.1) k times (k=8 suffices), we have 



(5.3) 
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where 

F k = Yl Q(vrVvr», 

\P\+h\<k 

where the Q's are null forms and we have ignored the irrelevant numerical constants. 

Once the following a priori estimates have been established, the rest of the proof will be 
routine. So we only give the details for the following estimates: 



Proposition 5.3. For all k < 8, assume that the standard energy fluxes on for (5.3) are all 
bounded above by e where e is a small positive constant. We also assume that such a solution 
of (1.1) exists in T>t where u$ + 5 < t < — 1 and T>t is the region below £f. If e is sufficiently 
small, there is a universal constant Co, such that 

Y sup E[ con H^){r)<Co-e. 
k=o r e[uo+<M] 

d 

Proof. First of all, using Killing vector field — in T>t, we have the following standard energy 

dt 

estimates: 

I |a t rVl 2 + ^l^rVl 2 <e 2 + 2 f [ \F k \\d t r k <p\. (5.4) 

Secondly, let C ut is the part of C_ u in V t . We can then apply the standard energy estimates in 
the region bounded by C ut , C_$, E t and C+ , we have 

"t 



[ |arVl 2 <e 2 + 2 [ [ \F k \\d t r k <p\. 

JC,. t Ju()+8 J E T 



1 

(i + N 



We multiply the above inequality by — — — 1+k with k > and integrate over u G [5, t — uq] , 



this yields 

|5rVl 2 _ r |5rVl 2 
/ D t (i + N) 1+K ~ J s Jc ut (i + N) 1+K 



ft 

<e 2 + 



f [ \F k \\d t r k <p\. 

Juo+8 JY.T 



Combined with (5.4), we obtain the following estimates which will serve as the main tool for the 
rest of the proof: 

W |ar>| 2 +W/ 'fTf ' 2 +K < . 2 + E r / \md t r^\. (5.5) 

| Q |< 8 ^St \ a \<8 JJT>t { m) \ a \<8,k<8 Juo+5J ^ 

We now use a bootstrap argument and we make the following bootstrap assumptions: 



£ sup 4 conf) (^)(T)<M. e . 

k—0 re[uo+8,t] 



where M is a large constant. 
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Since F^s are linear combinations of null forms, the nonlinear terms on the right hand side 
of (5.5) can be bounded by N\ + N 2 where 



Ni= J2 II Idr^pr^H^rVi, 

|/3| + |7|<8,fc<8,|7|<4^" 0+ ' 5 

N 2 = f I \dT^\\dY^\\d t Y a ip\. 



1 + l7l<8,fc<8,|/3|<4 

The control of N\ is slightly easier, we use Klainerman-Sobolev inequalities for |<9r 7 9j|. Most 
importantly, recall that the Klainerman-Sobolev inequalities improve a factor of — j- for good 

derivatives. This yields 



< 



H7|<8,fc<8,| 7 |<4- ?t0+5 M 2 ^ 



|/3|+|7|<8,fc<8,|7|<4 

< M 3 • e 3 . 



uo+5 \U 2 



To control A^2, we would like to use the second flux term in (5.5). To this end, we first use 
Cauchy-Schwarz inequality: 

iv 2 <I y t ' ar >' 2 

l/3| + l7l<8,fc<8,|/3|<4 A o+5 U + l«U 

+ c £ / / (i + N) 1+K |arVl 2 l^r>| 2 

|/3| + |7|<8,fc<8,|^|<4 Ju o+ & 



We can choose a large C so that the first term can be absorbed by the left hand side of (5.5); for 
the second term, we can use Klainerman-Sobolev inequalities to control |<9r^</?| 2 and the energy 
norms to control the rest, this yields 

N 2 < M 4 • e\ 



We put all these estimates back to (5.5), this yields 



y I \dT a <p\ 2 <e 2 + M 3 -e 3 + M 4 -e 4 . 



M<8' 

We can then take a sufficiently small e to close the argument. □ 

The parameter e is proportional to 5* . Let 5 be sufficiently small, therefore, we have con- 
structed a solution in Region 3. 
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5.3. Existence from Past Null Infinity. We will let uq go to — oo so that the null hypersur- 
face C U(i will approximate the past null infinity. 

We choose a decreasing sequence {uo,i} in such a way that «o,j — > — oo and we solve Goursat 
problem for (1.1) with initial data at C Uo i . We emphasize that the choice of the seed data ipo 
is the same for all uo/s. For each uo t i, we obtain a unique smooth solution ipi defined in the 
region T>i where 

T>i = {pe R 3+1 \t(p) < -l,u(p) > u ,i}. 

Moreover, by Sobolev inequalities, for all k < 8, there exists a constant Co independent of i, so 
that 

llVillc*^) < Co- 
Thanks to the lemma of Arzela-Ascoli, we could extract a subsequence of solutions, still denoted 
by {ifi}, converges uniformly in any compact set of {(t,x) S IR 3+1 |£ < —1}. We denote the limit 
to be cp and this is a classical solution of (1.1) all the way up to past null infinity. 

To prove the Main Theorem 2 of the paper (it implies the Main Theorem 1 in a straight- 
forward way), it remains to show the uniqueness from past null infinity. Suppose ip and 4> were 
two classical solutions for (1.1) with the same scattering data (1.5), therefore, 

D(<p-4>) = Q(V<p, Vy>) - Q(V<p, V<p) = F(V<p, V<f>), (5.6) 

with 



For r < -1, let 



lim \u (ip - 4>)\ u =u = 0. (5.7) 

UQ— > — oo 



E(r) = / \dt(<p - + XU\di(<P ~ 4>)f, 

Jt=T 



the condition (5.7) reads as 



lim E(t) = 0, 
t— >— oo 



i.e. E(t) = o(l). According to the standard energy estimates, we have 

E(t)<E(2t)+ I (f \F(Vv,V<t>)\\dt(<p-<l>)\)dT. (5.8) 

We observe that at least factor in the quadratic form F(<p, (j)) is a good term, i.e. it is either 
dip, dcj> or d((p — 4>). We use L°° bound for this term and it is bounded by 0(|t|~2). Therefore, 
(5.8) becomes 

E{t) < E(2t) + C / \t\-2E(t)<It 

< E(2t) + C \t\-t sup E(t) (5.9) 

re[2t,t] 



< E(2t) + C$\t\ 
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where Co is the size of E(t), which is actually small. We can iterate this estimate to derive 
(notice that the following o(l) is independent of k) 



k-l 

E{t)<E(2 k -t) + C^2-i)Cl\t\- l 2, V fc€Z> . 

j=0 

We then let k — > oo and we improve the decay E{t) = o(l) to be 



E{t)< C ° t (5.10) 
1 - 2-2 



We then substitute (5.10) into (5.9) to iterate again, we obtain 

E{t)<E(2t)+ ° 3 ° \t\~\ 
2(1 - 2 _ 3) 

We repeat the above dyadic iteration and we can further improve the decay of E(t) to be 

E (t) < -^\t\-\ 

1 - 2"2 



We the repeat the whole procedure several times and we can obtain, for all k G 7L 



C; 



3+fc 



m<— ,;° — I*'- 1 



a-2-t)nj=oii+ii 

Let fc —7- oo, it implies E(t) = 0. Therefore, we have proved the uniqueness. 

5.4. Proof of Main Theorem 3. It remains to prove the Main Theorem 3 and this is an 
easy consequence of the estim ates derived previously. We first recall that no is a fixed number. 
From the proof of Proposition 5.1, we know that, for all 9 G S 2 , u G [0,5] and u G [no, —1], we 
have 

i 



\Lip(u,u,9) - L(p(u ,u,9)\ < 62 
\f<p(u,u,9)-f<p(u ,u,9)\ <5^ 
\Lip(u,u,9) - L(p(u ,u,9)\ < 55 



Since we take the short pulse data localized in B^i (9$), we simply integrate the above inequalities 
and use the pointwise information of the initial data on C uo , this shows immediately 

J C u —C° 



[ \Lv\ 2 + \fv\ 2 - I |L^| 2 + |^| 2 |<5. 
Jc° Jc° 



Moreover, on C_$, we have seen that almost no energy radiating from it, in terms of estimates, 
this means E^u^S) < 5?. This demonstrates the strong focused phenomenon and completes 
the proof of the Main Theorem 3. 
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